Vezflz 1 to 3640 



CGGGG GGAGTGGGGAGGAGGGGGGTCGGCCGCCGCAGCCATGGAGGCCAACTGGACCGCGTTCCTGTTCC^^ 9 0 

MEAN WTAFLFQAHEASH17 

caccaacagcaggcagcgcagaacagcttgctgcccctcctgagtto 180 

HQQQAAQNSLLPLI.SSAVEPPDQKPLLPIP47 

ATTACTC AGAAACC TCAGGC TGCA.CCAGAAACATTAAAGGATGCCATTGGGATTAAAAAAGAAAAACCCAAAAC TTCGTTTGTGTGCAC T 2 70 
ITQKPQAAPETI.KDAIGIKKEKPKTSFV C T 77 

TACTGCAGTAAAGCATTCAGGGACAGCTATCACCTGLAGGCGC 3 60 
YCS^AFRDgyBEfiRHQSCB TGIKLVSRAKK 107 

ACCCCCACCACGGTGGTTCCCCTTATCTCCACCATTGCTGGGGACAGCAGCCGAACTTCGTTGGTTTCAACTATTGCAGGCATCTTGTCA 450 
TPTTVVPLISTIAGDSSRTSLVSTIAGILS137 

ACAGTCACTACATCTTCCTCGGGCACCAACCCCAGCAGCAGCGCTAGTACCACAGCAATGCCTGTGCCCCAGTCTGTCAAG 540 
TVTTSSSGTNPSSSASTTAMPVPQSVKKPS167 

AAGCCTGTCAAGAAGAACCACGCCTGTGAGATGTGTGGGAAGGCOT 630 
KPVKKNHA CEMCgKAFRDVYHT l N RHKI, fi TT S 197 

GACGAAAAGCCCTTTGAGTGTCCTATTTGTAATC^ 720 
D E K P F E CPXCNQRFKRKDRM T YHVRSg EGG 227 

ATCACCAAACCCTATACTTGCAGTGTTTGTGGGAAAGGCTTC^ 810 
I T K P X T CgVCGKGFSRPDHr, SCgVKBVfl S T 257 

GAAAGACCCTTCAAATGCCAAACGTGCACTGCTGCCTTTGCCACCAAAG^ 900 
E R P F K COTCTAAFATKDRLRT HMVRg E G K V 287 

TCATGTAACATCTGTGGGAAGCTCTTGAGTGCAGCATATATCACCAGCCACTTAA^ 99 0 
S CNICGKfrIr$AAYITSHLKTH GQSQSINCN 317 

ACGTGCAAACAAGGCATCAGCAAAACGTGCATGAGTGAG^ 1080 
TCKQGISKTCMSEETSHQKQQQQQQQQQQQ 347 

* *********** 

CAACAACAACAACATGTGACAAGCTGGCCAGGGAAGCAGGTAGAGACACTGAGACTGTGGGAAGAAGCTGTCAAAGCAAGAAAGAAAGAA 1170 
QQQQHVTSWPGKQVETLRLWEEAVKARKKE377 
* * * * 

GCTGCCAACCTGTGCCAAACCTCC^CGGCTGCTACGACACCAGTGACTCTCACTACTCCATTCAATATAACGTCCTCTGTGTCGTCTGGG 1260 
AANLCQTSTAATTPV TLTTPFNITS SVSSG 407 

ACTATGTCAAACCCAGTCACAGTGGCAGCTGCAATGAG 1 3 5 0 
TMSNPVTVAAAMSMRSPVNVSSAVNI TSPL 437 

GCCATGACCTCACCTTTAACACTCACCACCCCAGTCAACCTrc 1440 
AMTSPLTLTTPVNLPTPVTAPVNIAHPVTI467 

ACATCTCCAATGAACCTGCCCACTCCTATGACATTAGCTGCCCCTCTCAATATAGCAATGAGGCCTG 1530 
TSPMNLPTPMTLAAPLNIAMRPVESMPFLP497 

CAAGCTTTGCCTACGTCACCGCCTTGGTAAACAGTATTATAAGTC^^ 1620 
QALPTSPPW 506 

GAT TAAAGCAAAAAGCAGAC T ATGAAATTGGGAGGTTTTATTATGTT AGTTAAT AAGAGTGT AGTAGC T C C AATTTTGCTGGGGTT GTTC 17 10 

AAAGTAGGGTATATGTGTAACl^ATCACTGGACCACTTTAGTTTACTCAGAAACCCCTTTAGCTGACACCATTGCTTAAACAGG 1800 

GCTGGCAAGACGAAATGGCAGAATTAAAACCAATCATAAAAC^ 1890 

ACAACAGAATCATTTTATTGTAAACACTAGCAGAG 1980 

GAGAGCTAGTGTAGCATTGTCTGTGGTTTTGCTCGTATGAGTGAACAGAGGCATTGTCATAATAAA^ 2070 

TACCTTTGGGAATATGTTAATTTCAGGCAGCATTCCCTA 2160 

TCTAACTTCAACATATAATAGGGATTGTGACCT 2250 

GTAGTTGACTTTTTAAAAAAAAAAAAACAAAAAAAAAAAAAAGGAATTC 2340 

GGTGGCTTCAGAACCCAGGAAGTGGCCAAGGGGCACAGACTCTGCTGGAGGCCTGAGCCGGG 2430 

T.T TGC CT TAGGCCACAAAAAG AAGAAGGC T ACCCCACTT ACAGATGCAGACCATGTGGGGC TCCGGAGAACTGC TTGT AGCATGGTTTCT 2520 

AGTGTTGGCAGCAGATGGTACTACTGAGCA1GTCTACAGACCCA.GTCGTAGAGTGTGTA 2610 

GACCTTTGTGCAGCGTGTATACCAGCCTTACCTCACCACTTX5CGACGGACACAGAGCCTGCAGCACCTACCGAACCATCTACCGGACTGC 2700 

CTATCGCCGTAGCCCTGGGGTGACTCCCGCAAGGCC^ 2790 

TGGAGCAGGAATATGCCAGCCTCCATGTGGGAATGGAGGGAGT^ 2880 

CTTGCCAGACAGATGTTGATGAATGCAGTACAGGAGAGGCCAGTTGTCCC^ 2970 

GATGGGAGGGACAAAGCCCATCTGCAGATGGGACCCCCTGCCTGTCTAAGGAGGGGCCCTCCCCTTTC 3060 

TGG ACAGCATGGCGAGAGAGGAGGTGTACAGGCTGCAGGCTCGGGTTG^ 3150 

ACAGCCTGGCCTCTCGGTCCAGAGAGCATG^ 3240 

AGTCCCACAAGACACCACTCCACCCACAGAGAGCCTAGGGGACCCATGGGGTGGACACCAGGGCTGGGTGGATGGAACTTCTTCTGGGAT 3330 

GGGCAGATTTGCAAGTTTACACCTTTTTTCCTCTC^ 3420 

TAGAGCAGTATCCAGATCCCTTGTAGCCAGAGTTCAGGGACGCTGTC 3510 

TCTTAGGAGGTTCCTAGGACTTGGGTATGGGGAGTGGGGTC 3600 

TGTCTCCATCTTTGTCATAATAAAGCTGAGACTTGAAAAAAAAAA 3645 



Human DB1 DNA and Protein Segences: 

10 20 30 40 50 60 

AGCGGGGGGAGTGGGGAGGAGGGGGGTCGGCCGCCGCAGCCATGGAGGCCAACTGGACCG 

M E A N W T> 

70 80 90 100 110 120 

CGTTCCTGTTCCAGGCCCATGAAGCTTCCCATCACCAACAGCAGGCAGCACAGAACAGCT 
AFLFQAHEASHHQQQAAQNS> 

130 140 150 160 170 180 

TGCTGCCCCTCCTGAGCTCTGCCGTGGAGCCCCCTGATCAGAAACCATTGCTTCCAATAC 
LLPLLSSAVEPPDQKPLLPI> 

190 200 210 220 230 240 

CAATAACTCAGAAACCTCAGGGTGCACCAGAAACATTAAAGGATGCCATTGGGATTAAAA 
PITQKPQGAPETLKDAIGIK> 

250 260 270 280 290 300 

AAGAAAAACCCAAAACTTCATTTGTGTGCACTTACTGCAGTAAAGCTTTCAGGGACAGCT 
KEKPKTSFVCTYCSKAFRDS> 

310 320 330 340 350 360 

ATCACCTGAGGCGCCACGAATCCTGCCACACAGGGATCAAGTTGGTGTCCCGGCCAAAGA 
YHLRRHESCHTGIKLVSRPK> 

370 380 390 400 410 420 

AAACCCCCACCACGGTGGTTCCCCTTATCTCTACCATCGCTGGGGACAGCAGCCGAACTT 
KTPTTVVPLISTIAGDSSR T> 

430 440 450 460 470 480 

CGTTGGTCTCGACCATTGCAGGCATCTTGTCAACAGTCACTACATCTTCCTCGGGCACCA 
SLVSTIAGILSTVTTSSSGT> 

490 500 510 520 530 540 

ACCCCAGTAGCAGTGCCAGCACCACAGCTATGCCAGTGACCCAGTCTGTCAAGAAACCCA 
NPSSSASTTAMPVTQSVKKP> 

550 560 570 580 590 600 

GTAAGCCTGTCAAGAAGAACCATGCTTGTGAGATGTGTGGGAAGGCCTTCCGAGATGTGT 
SKPVKKNHACEMCGKAFRD V> 

610 620 630 640 650 660 

ACCATCTCAATCGACACAAGCTCTCCCATTCAGATGAGAAACCCTTTGAGTGTCCTATTT 
YHLNRHKLSHSDEKPFECP I> 

670 680 690 700 710 720 

GTAATCAGCGCTTCAAGAGGAAGGACCGGATGACTTACCATGTGAGGTCTCATGAAGGAG 
CNQRFKRKDRMTYHVRSHEO 



730 740 750 760 770 780 

GCATCACCAAACCCTATACTTGCAGTGTTTGTGGGAAAGGCTTCTCAA.GGCCTGACCACT 
GITKPYTCSVCGKGFSRPDH> 



Fifrv&e ^ Cco/ur J 

790 800 810 820 830 840 

TAAGCTGTCATGTAAAACATGTCCATTCAACAGAAAGACCCTTCAAATGCCAAACGTGCA 
LSCHVKHVHSTERPFKCQTO 

850 860 870 880 890 900 

CTGCTGCCTTTGCCACCAAAGACAGACTGCGGACACACATGGTGCGCCATGAAGGCAAGG 
TAAFATKDRLRTHMVRHEG K> 

910 920 930 940 950 960 

TATCATGTAACATCTGTGGGAAGCTCCTGAGTGCAGCATACATCACCAGCCACTTAAAGA 
VSCNICGKLLSAAYITSHL K> 

970 980 990 1000 1010 1020 

CTCATGGGCAGAGCCAAAGTATCAACTGTAATACATGTAAACAAGGCATCAGTAAAACAT 
THGQSQSINCNTCKQGISKT> 

1030 1040 1050 1060 1070 1080 

GCATGAGTGAAGAGACCAGTAACCAAAAGCAGCAGCAGCAGCAGCAGCAGCAACAACAAC 
CMSEETSNQKQQQQQQQQQQ> 

1090 1100 1110 1120 1130 1140 

AACAACAACATGTGACAAGCTGGCCAGGGAAGCAAGTAGAAACACTCAGACTGTGGGAAG 
QQQHVTSWPGKQVETLRLWE> 

1150 1160 1170 1180 1190 1200 

AAGCTGTTAAAGCAAGGAAGAAAGAAGCTGCTAACCTGTGCCAAACCTCCACGGCTGCTA 
EAVKARKKEAANLCQTSTA A> 

1210 1220 1230 1240 1250 1260 

CGACACCTGTGACTCTCACTACTCCATTCAGTATAACATCCTCTGTGTCGTCTGAGACTA 
TTPVTLTTPFSITSSVSSET> 

1270 1280 1290 1300 1310 1320 

TGTCAAACCCAGTCACAGTGGCAGCTGCAATGAGCATGAGAAGTCCAGTAAATGTTTCAA 
MSNPVTVAAAMSMRS PVNVS> 

1330 1340 1350 1360 1370 1380 

GTGCAGTTAACATAACCAGCCCAATGAACATAGGGCATCCTGTAACTATAACCAGTCCAT 
SAVNITSPMNIGHPVTITSP> 

1390 1400 1410 1420 1430 1440 

TATCCATGACCTCTCCTTTAACACTCACTACCCCAGTCAACCTCCCCACCCCCGTCACTG 
LSMTS PLTLTTPVNLPTPV T> 

1450 1460 1470 1480 1490 1500 

CCCCAGTGAATATAGCACACCCTGTCACCATCACATCTCCAATGAATCTACCCACACCTA 
APVNIAHPVTITSPMNLPTP> 

1510 1520 1530 1540 1550 1560 

TGACATTAGCCGCCCCTCTCAATATAGCAATGAGACCTGTAGAGAGCATGCCTTTCTTGC 
MTLAAPLNIAMRPVESMPF L> 



F(<rO&£ X Corvct) 

1570 1580 1590 1600 1610 1620 

CCCAAlGCTTTGCCTACATCACCGCCTTGGTAAACAGTATTATAAAATCAAAATATGGGTA 
PQALPTSPPW*> 

1630 1640 1650 1660 1670 1680 

AAAGTAAATATTTACCAGCAACTTAACTTTTAGTTGATTAAAGCAAAAAGTAAACCATGA 

1690 1700 1710 1720 1730 1740 

AATTGGGAGATTTTATTACATTAGTTAATAAGAGTGTGGTAGCATTTTTCTCCAATTTGG 

1750 1760 1770 1780 1790 1800 

CTGGGATTATTCAAAGTAGGGTGTGTATGTAACTTATCACTGGACCACTTTAGTTTAATC 

1810 1820 1830 1840 1850 1860 

AGAAATTCCTTTTAGCTGACAACATTGCTTAAACAGGATAGTAGTTGGCAAGATGAAATG 

1870 1880 1890 1900 1910 1920 

CCAGAATTAAAACCAATCATAAGTAGAACCCACTTCAAAATAAAAAAACAGCATTACTAT 

1930 1940 1950 1960 1970 1980 

TTCTAATCCCAAGGAATCACTTTATTGTAAACACTAGCAGAACTCTTCTCCCTATACAAG 

1990 2000 2010 2020 2030 2040 

GTGGATGGCTGATTTTAACCTGAAATTTTAAATCCACAGATTGAGAGCTAGTGTAGAATT 

2050 2060 2070 2080 2090 2100 

GTCTGTGTTTATTGTTTTTATGAGTAAATACATGCATTGTCATAATAAAATGCATTTCAG 

2110 2120 2130 2140 2150 2160 

AGAATATGCATTTTACCTTTGGGAATATGTTAATTTCAGGCAGCATTCCCTATGGGAAAG 

2170 2180 2190 2200 2210 2220 

GTGATACCAGCTCTGATATGCAAAGCATATGATAATTTATCATTCTAACTTCAACGTATA 

2230 2240 2250 2260 2270 2280 

ATAGGGATTGTGACCTGATATTTGGAGATGTAAATATTGCTCAGCATATTAATCCCGATG 



2290 2300 
GAATATAGCATTGTAGTTGACTTTTT 



A. Retroviral 
Entrapment 
Vector 
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B. infection of ES cells 
Isolation of G418 r clones 



C. Screen for AP expression 
in embryoid bodies 
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D. Selection of ES cell clones with 
differential AP expression in vitro 



E. Aggregation chimeras 
Transfer into pseudo- 
pregnant foster females 



F. AP expression in 
day-8.5 to 9.5 embryos 




G. Cloning of 

flanking cellular DNA 

by supF complementation 
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A: Jackson BSS Chromosome 2 



D2MH6 

Abi1 

Pax8 

Vezfl 

Abl 

Psmb7 
Acvr2a 



HHBB 
S0SBSSSB8B 

4534 2 2 11114 3 



R 
2.13 
2.13 
2.13 
1.06 
1.06 
7.45 



SE 
1.49 
1.49 
1.49 
1.06 
1.06 
2.71 



B: Jackson BSS Chromosome 2 



- - D2Mit1, D2Bir2 

+ D2MH4, D2MU31 

Jund2, Mpmv3 
D2MM78, D2Mlt464 
D2Mit6 

Abi1 

D2Bir5, Pax8 

Lhx3, Notch 1, Vezfl 
Abl, Eng, Pbx3 
P$mb7 



5cM 



Acvr2a, D2MH155 
•+• Ctsb-rs, D2MH8 



Restriction Enzyme Map of a 20 kb Genomic DNA of the Vezfl Gene 

2kb 




BamHI (B), EcoRI (E), EcoRV (E5), Eagl (Ea), Notl (N), Patl (P), Sad (S), 
U Xbal (Xb), and Xhol (Xh). 

O — Intronic sequence; 

| [T] Exon 1 

U \Z] Exon 2 



CMV-Vezf1 
Tie2-Vezf1 



Vezfl EXPRESSION VECTORS 



CMV/ 
Tie2 



Vezfl cDNA 
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SV40 
intron+p(A ) 



CMV-AZF-Vezf1 
Tie2-AZF-Vezf1 



CMV-Vezf1-ZF 
Tie2-Vezf1-ZF 



CMV-AP 
Tie2-AP 



309 992 
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